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(54) Title: LIGATION-BASED SYNTHESIS OF OLIGONUCLEOTIDES WITH BLOCK STRUCTURE 

(57) Abstract: The present invention relates to a method of producing single-stranded nucleic acid molecules from oligo- or polynu- 
cleotides wherein each of said oligo- or polynucleotides has a predefined 5' or 3' terminus, comprising the steps of (a) annealing an 
adaptor oligonucleotide simultaneously or step by step to (aa) a first oligo- or polynucleotide; and (ab) a second oligo- or polynu- 
cleotide wherein the 5-terminus of said adaptor oligonucleotide is complementary in sequence to the 5' terminus of said first oligo- 
or polynucleotide and the 3'terminus of said adaptor molecule is complementary in sequence to the 3* terminus of said second oligo- 
or polynucleotide; and optionally (a') simultaneously with or subsequently to step (a) annealing at least one further adaptor oligonu- 
cleotide to free termini of said first or second oligonucleotides and to free termini of further oligo- or polynucleotides; (b) optionally 
filling in gaps between the neighbouring ends of said oligo- or polynucleotides; (c) ligating said oligo- or polynucleotides; and (d) 
removing said at least one adaptor oligonucleotide. In a preferred embodiment of the method of the invention, said single-stranded 
nucleic acid molecules represent a collection of nucleic acid molecules wherein either said first or said second oligo- or polynu- 
cleotide is invariable in sequence between all members of said collection of nucleic acid molecules. 
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Ligation-based synthesis of oligonucleotides with block structure 

The present invention relates to a method of producing single-stranded nucleic 
acid molecules from oligo- or polynucleotides wherein each of said oligo- or 
polynucleotides has a predefined 5' or 3' terminus, comprising the steps of (a) 
annealing an adaptor oligonucleotide simultaneously or step by step to (aa) a 
first oligo- or polynucleotide; and (ab) a second oligo- or polynucleotide wherein 
the S'-terminus of said adaptor oligonucleotide is complementary in sequence to 
the 5* terminus of said first oligo- or polynucleotide and the 3'-terminus of said 
adaptor molecule is complementary in sequence to the 3' terminus of said 
second oligo- or polynucleotide; and optionally (a') simultaneously with or 
subsequently to step (a) annealing at least one further adaptor oligonucleotide to 
free termini of said first or second oligonucleotides and to free termini of further 
oligo- or polynucleotides; (b) optionally filling in gaps between the neighbouring 
ends of said oligo- or polynucleotides; (c) ligating said oligo- or polynucleotides; 
and (d) removing said at least one adaptor oligonucleotide. In a preferred 
embodiment of the method of the invention, said single-stranded nucleic acid 
molecules represent a collection of nucleic acid molecules wherein either said 
first or said second oligo- or polynucleotide is invariable in sequence between all 
members of said collection of nucleic acid molecules. 



The invention is particularly efficient for the synthesis of long polynucleotides or 
sets of oligonucleotides with block structure. 

As is known in the art, oligonucleotide-producing companies cannot guarantee a 
quantitative yield for oligos longer than 100 nucleotides (nt). Moreover, the yield 
and quality of the synthesis decrease dramatically when the length of 
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oligonucleotide is more than 60 nt. The smallest possible scale of synthesis for 
80-100mers is more then 200 nmol. Two-step purification (HPLC and PAGE) is ■ 
required to obtain single-band oligonucleotides. The guaranteed output of 
purified 80-100mers is less than 1 nmol and the price is about 200-300 Euro. 



Oligonucleotides with block structure are widely used in molecular biology 
(Figure 1). Examples are: padlock probes (Lizardi et a|. 1998; Pickering et al. 
20Q2); primers with constant 5' regions, u?ed for multiplex PGR amplification 
(Favis et aL 2000; Liridblad-Toh et al. 2000), ligase-jndependent cloning (de 
.Costa and Tanuri 1998; Rashtchian et aL 1992; Zhou and Hatahet 1995; and 
commercial kits from Novagen, Invitrpgen, BP Biosciences) and Invader assay 
(Mein et al. 2000). 

Normally, these primers are synthesized by phosphoramidite . technology and 
common regions have to be synthesized again and again in different 
oligonucleotides. It is expensive, especially, when the common part contains a 
hapten or fluorpphore. 

This problem becomes evident, for example, in the preparation of sets of padlock 
probes for SNP-detection projects. Padlock probes are typically 90-1 20nt long 
oligonucleotides, which consist of two locus-specific regions on both 3' and 5' 
ends connected by universal linker part (Figure 1 A). The high price and the low 
yield of synthesis are the main obstacles for routine usage of padlock probes. 
Though they were shown to be an excellent tool for SNP detection and in situ 
localization, only few laboratories work with padlocks until now. Accordingly, the 
technical problem underlying the present invention , was to provide methods for 
the quantitative and cost-sensitive production of single-stranded nucleic acid 
molecules that can in particular be employed as padlock probes. 



WO 2004/092375 



PCT/EP2004/003921 



3 

The solution to said technical problem is achieved by providing the embodiments 
characterized in the claims. Thus, the present invention relates io a method of 
producing single-stranded nucleic acid molecules from oligo- or polynucleotides 
wherein each of said oligo- or polynucleotides has a predefined 5' or 3' terminus, 
comprising the steps of (a) annealing an adaptor oligonucleotide simultaneously 
or step by step to (aa) a first oligo- or polynucleotide; and (ab) a second oligo- or 
polynucleotide wherein the S'-terminus of said adaptor oligonucleotide is 
complementary in sequence to the 5' terminus of said first oligo- or 
polynucleotide and the 3-terminus of said adaptor molecule is complementary in 
sequence to the 3' terminus of said second oligd- or polynucleotide; and 
optionally (a') simultaneously with or subsequently to step (a) annealing at. least 
one further adaptor oligonucleotide to free termini of said first or second 
oligonucleotides and to free termini of further . oligo- or polynucleotides; (b) 
optionally filling in gaps between the neighboring ends of said oligo- or 
polynucleotides; (c) ligating said oligo- or polynucleotides; and (d) removing said 
at least one adaptor oligonucleotide. 

In accordance with the present invention, the term "oligonucleotide" refers to a . 
uni-dimensional (i.e. not branched) stretch of nucleotides, preferably 
deoxyribonucleotides up to 30 nucleotides. The term also comprises 
oligonucleotides comprising or consisting totally, of ribonucleotides. Also 
envisaged is that the oligonucleotides comprise unusual nucleotides such as 
unusual nucleotides as, for example, deoxyuridine, biotinylated or fluorescently 
labeled nucleotides, spacers or abasic residues. It is preferred that the 
oligonucleotide employed in the method of the invention consists of the four 
naturally occurring deoxyribonucleotides, i.e. adenine, cytosine/ guanine and 
tymidine. 

The term "polynucleotide" in accordance with the invention may consist of the 
same types of nucleotides that are described above for oligonucleotides. 
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However, a polynucleotide in accordance with the invention comprises a 
unidimensional stretch of at least 31 nucleotides. 

The term "S'-terminus" refers to the 5'-terrninal part of an oligo- or. polynucleotide, 
preferably the terminal 5 or 4 nucleotides. 

The term "complementary in sequence" refers to complementarity in sequence of 
at least 75% of the respective nucleotides, preferably at least 90% of the 
respective nucleotides and most preferred 100% of the respective nucleotides. 

In accordance with the present invention, a novel method of producing 
oligonucleotides by ligation of individual fragments by a ligase such as T4 DNA 
ligase is described. It is simple and allows the simultaneous processing of 
several reactions. The method is quantitative, cheap and does not require 
individual optimization. The possibility to purify products by HPLC makes ' the 
technology suitable for large-scale genomic projects. On the other hand, the 
same approach may be used for small-scale synthesis of composite primers for 
two-step PCR amplification and ligation-independent cloning. . Small-scale 
reaction does not require any purification. 

The method of the invention requires oligo- or polynucleotides having a 
predefined 5' or 3' terminus to which an adaptor polynucleotide, which is 
complementary in sequence to, said predefined 5' or 3' terminus is annealed. 
Simultaneously or subsequently, the second oligo- or polynucleotide is annealed 
to said adapter oligonucleotide byway of complementarity of its 5' or 3' terminus. 
A schematic overview over the annealing process is provided in Fig. 2B wherein 
the first oligo- or polynucleotide may be represented by #R, the second oligo- or 
polynucleotide may be represented by #C and the adaptor oligonucleotide may 
be represented by #aR. Alternatively, the first oligonucleotide or polynucleotide 
may be represented by #C, the second oligonucleotide or polynucleotide may be 
represented by #L and the adaptor oligonucleotide may be represented by #aL. 
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The situation including optional step (a 1 ) is represented by the complete 
arrangement of oligonucleotides depicted in Fig. 2B. For example, if #R 
represents the first oligo- or polynucleotide and #C represents the second oligo- 
or polynucleotide and #aR represents the adaptor oligonucleotide, then #aL 
represents the further adaptor oligonucleotide and #L represents the further 
oligo- or polynucleotide. 

If gaps are obtained after annealing of the adaptor oligonucleotide(s) to said first, 
second and optionally further oligo- or polynucleotide(s) then the gaps are filled 
in, for example, by polymerase activity such as T4 DNA polymerase activity. 
Subsequently, the at least two oligo- or polynucleotides are ligated using an 
appropriate ligase. Appropriate ligases depend, inter alia, on the nature of the 
oligo- or polynucleotides used for preparation of the single-stranded nucleic acid 
molecules. For example, if the oligo- or polynucleotides are DNA, than it is 
preferred to use the T4 DNA ligase. Other ligases may also be used, for example 
thermostable commercially available Tth, Taq or Pfu ligases. Another possibility 
to perform ligation is a chemical template-dependent reaction (Xu and Kool 

1999) , which uses chemically activated oligonucleotides instead of enzyme. . 

Finally, the at least one adaptor oligonucleotide is removed. Removal can be 
effected by denaturated PAGE or chromatography, such as FPLC or HPLC. 
Other methods are known in the prior art comprising capture of biotine labeled 
adaptors or destruction of ribonucleotide adaptors by RNase (Nilsson et al. 

2000) . 

The above steps performed in accordance with the present invention per se can 
be effected by the person skilled in the art according to conventional protocols 
such as are provided in the appended examples. Temperature ranges include 4 
to 42°C for annealing, fill-in reactions and ligation. 
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Reaction buffers include conventional reactions, buffers such as disclosed, for 
example, in (Sambrook and Russell 2001). 

Preferred is a method of the invention wherein the complementarity in sequence 
is at least four nucleotides such as 5, 6, 7, 8, 9 or 10 nucleotides. It is particularly 
preferred that the number of nucleotides which are complementary in sequence 
is five nucleotides. Also particularly preferred is that there is no mismatch within 
the stretch of complementarity. 

In another preferred embodiment the invention relates to a method wherein 
annealing and ligation are simultaneously performed. Buffers can easily be 
adjusted to have annealing and ligation performed simuftaneously. If these steps 
are performed simultaneously, then it is preferred that optional step (b) is 
omitted. The method of the invention can in this way be accelerated. 

It is also preferred in accordance with the method of the present invention that 
the most valuable oligo- or polynucleotide in step (a) and/or (a') is provided in 
molar deficit relative of other oligo- and polynucleotides. The molar deficiency will 
guarantee that said oligo- or polynucleotide is consumed in the ligation reaction. 
The term "most valuable oligo- or polynucleotide" with respect to the present 
invention refers to invention refers to either (i) the most expensive oligonucleotide 
(labeled by hapten or fluorophore or the longest oligonucleotide) or (ii) 
oligonucleotide available in less quantity if compared with others. 

In another preferred embodiment, the present invention relates to a method, 
wherein said single-stranded nucleic acid molecules represent a collection of 
nucleic acid molecules and wherein either said first or said second oligo- or 
polynucleotide is invariable in sequence between all or essentially all members 
of said collection of nucleic acid molecules. 
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The term Essentially all members 0 refers to at least 90%, preferably at least 
95%, more preferred at least 98% and most preferred to at least 99% such as 
99.5% or 99.8% of all members. 

This advantageous embodiment of the invention relates to, in other terms, a 
method of producing a collection of single-stranded nucleic acid molecules 
wherein each member of said collection of nucleic acid molecules comprises a 
portion that is invariable between all or essentially all members of said collection 
and at least one portion that is variable between different members of said 
collection and that is located 5* or 3' of said invariable portion, comprising the 
steps of (a) annealing at least one adaptor oligonucleotide simultaneously or 
step by step to (aa) an oligo- or polynucleotide representing said invariable 
portion; and (ab) oligo- or polynucleotides representing said variable portions, 
wherein (i) a first part of said at least one adaptor oligonucleotide is 
complementary in sequence to the 5' terminus of said nucleic acid molecule 
representing said invariable portion and a second part of the at least one adaptor 
molecule is complementary in sequence to the 3' terminus of a nucleic acid 
molecule representing said variable portion; or (ii) a first part of said at least one 
adaptor oligonucleotide is complementary in sequence to the 3' terminus of said 
nucleic acid molecule representing said invariable portion and a .second part of 
the at least one adaptor molecule is complementary in sequence to the 5' 
terminus of a nucleic acid molecule representing said variable portion; (b) 
optionally filling in gaps between the neighbouring ends of said invariable and 
said variable portions; (c) ligating the invariable and variable portions; and (d) 
removing said at least one adaptor oligonucleotide. In accordance with this 
preferred embodiment, it is further particularly preferred that said nucleic acid 
molecule representing said invariable portion is annealed with two adapter 
oligonucleotides, wherein further one of said adapter oligonucleotides is in a first 
part complementary in sequence with the 5' end of said nucleic acid molecule 
representing said invariable portion and the second adapter oligonucleotide is in 
a first part complementary to the 3' end of said nucleic acid molecule 
representing said invariable portion. In this embodiment, the respective termini of 
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the adaptor polynucleotides not annealed to said invariable portion are annealed 
to termini of oligo- or polynucleotides representing variable portions of the single- 
stranded nucleic acid molecule. A schematic overview of such an arrangement is 
provided in Fig. 2B. 

This embodiment of the method of the invention is particularly advantageous in 
the cost-sensitive and easy production, for example, padlock probes. It is also 
advantageous to use resulting single-stranded nucleic acid molecules in two-step 
PCR or ligase-independent cloning as will be discussed further below. 

In principle, the nucleic add molecules representing the variable portions may 
have at least one conserved terminus, namely the terminus that anneals to the 
adaptor oligonucleotide. In this case adaptor oligonucleotides may be essentially 
the same for the whole collection. Alternatively, the oligo- or polynucleotides 
representing variable portions may be without any conservative parts. Then the 
special adaptor oligonucleotide should be used for annealing of each particular 
nucleic acid molecule representing the variable portion. The terminus of said 
special adaptor oligonucleotide not annealed to the nucleic acid molecules 
representing the invariable portion must be predefined in order to allow a 
successful annealing reaction. 

Most preferred is a method of the invention wherein the further oligo- or 
polynucleotides are variable in sequence between different members of said 
collection of nucleic acid molecules. 

In accordance with the embodiments pertaining to the production of a collection 
of single-stranded nucleic acid molecules, it is finally preferred that the 5' or 3' 
termini of said oligo- or polynucleotides representing said variable sequences 
which anneal to said 5' or 3' termini of said adaptor oligonucleotide are invariable 
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between different members of said Qligo- or polynucleotides representing said 
variable sequences. 

In another preferred embodiment of the method of the invention, ligation is 
effected with T4 DNA ligase. Most preferred is that about 1 unit of T4 DNA ligase 
is reacted in step (c) with about 4pmol of termini of the oligo- or polynucleotides 
annealed to said adaptor molecule(s). It is also preferred in this embodiment that 
the ligation reaction is carried out at a temperature of about 20 P C. Ligation 
efficiency may significantly be increased if the reaction is carried out at, for 
exampie, 37°C-the temperature optimum for T4 DNA ligase. At this temperature, 
it is required that ^complementary sequences comprise 5 or more nucleotides. 

Further preferred is a method wherein the ligation reaction is carried out in the 
presence of some molecular crowding agent, .for example, with at least 5% 
polyethylene glycol. The inclusion of polyethylene glycol above the indicated 
range is advantageous because it increases the ligation efficiency. 

In accordance with this preferred embodiment, it is more preferred that the 
ligation reaction is carried out in the presence of 12 to 18% polyethylene glycol. It 
is particularly preferred that the ligation reaction is carried out in the presence of 
about 15% polyethylene glycol. 

Preferred in accordance with the method of the invention is further that said 
polyethylene glycol is polyethylene glycol 6000. 

In an additional preferred embodiment of the present invention, the method 
further comprises the step of purifying said single-stranded nucleic acid 
molecules. Purification can be performed according to standard protocols, see for 
example Sambrook, J., D. Russell. 2001. Molecular cloning: A laboratory manual. 
Cold Spring Harbor Laboratory, Cold Spring Harbor, NY. . 
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Purification advantageously includes PAGE (Polyacrylamide Gel 
Electrophoresis) preferably under denaturing conditions, FPLC or HPLC or 
chromatography. Also preferred is an embodiment of the method of the invention 
further comprising modifying at least one of said oligo- or polynucleotides. In the 
alternative of the aforementioned embodiment, at least one of said oligo- or 
polynucleotides is modified when added to the reaction, i.e. the first step of the 
method of the invention. 

In accordance with this preferred embodiment of the invention,. the modification 
of the at least one oligd- or polynucleotide may be effected during one step of the 
method of the invention, for example when performing the fill-in reaction. 
Alternatively, a pre-modified oligonucleotide or polynucleotide may be included in 
the steps of the method of the invention. Modifications may be manifold and 
include the modifications recited herein below as being preferred. 

Advantageously, the modification is a ribonucleotide, a spacer or a nucleotide 
comprising a detectable label. 

. Detectable labels include bioluminescent, phosphorescent, biotinilated, 
fluorescent and radioactive labels such as labels with 32 P or 3 H. 

In a particularly advantageous embodiment of the method, said oligo- or 
polynucleotides representing the invariable sequence are modified. 

It is also preferred to automate the method of the invention. The ligation reaction 
may be assembled by . liquid handling automated system. It is additionally 
preferred that the final product is purified by HPLC, 

In an additional preferred embodiment of the method of the present 
invention, said method further comprises employing members of said collection 
of nucleic acid molecules in ligase-independent cloning (LIC). Composite primers 
for UC have gene-specific 3'-parts and special 5'-parts (LIC system, Novagen; 
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In-Fusion PCR cloning, BD Biosciences Clontech; Gateway PCR cloning system, 
Invitrogen). 



The figures show: 

Figure 1 Applications of oligonucleotides with block structure. Gene-specific 
parts are white, common parts are black, A. Template-dependent ligation of 
padlock probe. B. Primers for multiplex PCR amplification and ligation- 
indepehdent cloning. C. Invader assay. 

Figure 2: Padlock synthesis. A, Step by step PAGE analysis of padlock 
synthesis. Bands were visualized by UV shadowing as described in Methods. 
Lane 1 - unligated primers; lane 2 - result of ligation; lane 3 - PAGE-purified 
padlock probe. Aliquots of the same reactions were taken for this gel. B. Scheme 
of ligation. C. PCR-based approach for padlock synthesis (Antson et sd. 2000; 
Myer and Day 2001). Amplification is performed with two gene-specific primers 
having long 3' overhands (#PCR_ L and #PCR_R) on template #PCR_C. Single 
stranded padlock is purified after annealing to Streptavidine paramagnetic 
particles. 

Figure 3: Ligation of different primers with the same 4nt overhangs. Ligation of 
[^32P]ATP labeled #top and #bot primers with (#1; #a1) and (#2; #a2); see 
scheme under Table 1 . Ligation was performed as described in Methods. Lanes 
1-8: #bot primer; lanes 9-16: #top primer. Lanes (1-7) and (9-15) correspond to 
the sequential two times dilutions of T4 DNA ligase (400u for lanes 1 and 9). 
Lanes 8 and 16 - control without ligase. A. Ligation with #1 and #a1. B. Ligation 
with #2 and #a2. 

Figure 4: Application of ligated primers. A. Padlock probe circularizes only in the 
presence of perfectly matched template. Circular products have decreased 
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mobility in PAGE comparing with linear Ones. Lane 1 --control without ligase; 2 — 
ligation on #T1 (perfectly matched) template; 3 - ligation on #T2 (mismatched) 
template; B. PCR amplification of ORF of phi29 polymerase (1,7kb) with 
composite and gehe-speoific primers. Lines 1-5 - composite primers (#1&#top 
and #2&#b.of); lines 6-10. •* gehe^specjfic primers (#top and #bot). Lanes 1 and 6 
- after 12 cycles; lanes 2 and 7 - after 16 cycles; lanes 3 arid 8 - after 20 Cycles; 
lanes 4 arid 9 ^ after 24 cycles; lanes 5 and 10 - after 28 cycles. M - marker. 



MATERIALS AND METHODS 

Oligonucleotides were synthesized by TIB Molbiol (Berlin, Germany). Primer 
sequences are given in Table 1. T4 DNA ligase arid T4 PNK were from New 
England BiOLabs (Beverly, U&A). Tfh ligase was from ABQene (UK). 

Polyacrylamide gels with radiolabeled oligonucleotides were, exposed on a Fuji 
lrriagjng Plate without any fixation. For long exposition (rnore than couple of 
hours) cassette With gel wa$ f feezed at -2Q°6. Freezing prevents diftuslOri of 
even 4nt long Oligonucleotides. 

Padlock probes. 

Phosphorylation of #0 and #L oligonucleotides was performed at 37 6 C fo.t 1 hour. 
1hmol of primer was Incubated in 10>l of T4 PNK buffer (TrisHOl, pH 7.6 70mM; 
MgCfe 1.0mM; PTT 5mM) with 1 nriM ATP arid %M of T4 PNK followed by enzyme 
heat inaofiVatiOri at 65 6 0 for 20 miriutes. Phosphorylated primers Were us$d in 
ligation reactions without purification. 
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The scheme of the ligation-based synthesis of padlock probe is shown on Figure 
2B. 200pmol-scale ligation reaction was performed for 1 hour at 20°C in 20jxl of 
mixture: 1xT4 ligase buffer (TrisHCI, pH 7.5 50mM; MgCI 2 10'mM; DTT 10mM;. 
ATP 1mM; BSA 25^ml); PEG 6000 15%; T4 DNA ligase 100u. To ensure, that 
all #C oligonucleotides are consumed in annealing and ligation reactions, 
adaptors and locus-specific primers were taken in a slight excess relative to the 
common primer: #C - 200pmol; #aR=#aL - 220pmpl; #R=#L - 240pmol 
(1:1,1:1,2). In some experiments adaptors were annealed to the common primer 
before, addition of enzyme and locus-specific primers (heating the mixture to 
90°C and cooling gradually to normal temperature), but this measure is not 
essential for the method of the invention. 

Padlock probes may be phpsphorylated directly in the ligation mixture after heat 
inactivation of T4 DNA ligase (e. g. 65°C for 15 minutes). 

Padlocks were purified through denaturing PAGE electrophoresis. The 
corresponding band was visualized by UV shadowing on the DC Alufolien 
Kiselgel 60F254 (Merck, Germany) chromatographic plate (or on printer paper 
with a somewhat lower, sensitivity) and was cut out. DNA was ethanol 
precipitated after elution from the gel in 150jxl of (TrisHCI, pH 7.5 10mM; EDTA 
1 mM; NaCI 200mM) for 1 hour at 60°C. 

Circularjzation of 40fmol of a [^32P]ATP labeled padlock probe on 2fmol of 
matched or mismatched synthetic template was performed in 10pJ of 1xTth 
ligation buffer (TrisHCI, pH 8.3 20mM; MgCI 2 10mM; KCI 50m.M; EDTA 1mM; 
NAD + ImM; DTT 10mM; Triton X-100 0,1%) by 1u of Tth ligase for 20 cycles of 
(94°G for 20 sec and 60°C for 3 min). 



Composite primers for PCR. 
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Ligation was performed separately for (#top, #1, #a1) and (#bot, #2, #a2) primer 
sets: 1 hour at 20°C and 20 min at 70°C in 10|xl of mixture: 1xT4 ligase buffer; 
PEG 6000 15%; T4 DNA ligase 400u; phpsphprylated #top pr #bot primers - 
20pmol; #a1 or #a2 - 25pmol; #1 or #2 - 30pmol. Composite primers were used 
in PCR amplification without any purification. 

Amplification was performed in 50uJ with Advantage cDNA polymerase Mix 
(Clontech, USA). 1uJ of phage phi29 suspension (5x1 0 10 1/ml; DSMZ, Germany) 
was used as a template. Parameters of PCR with #top and #bot primers were: 
10pmol of both primers; 96°C 2min (95°C 20sec, 62°C 20seC, 68°C 
1min20sec)x28 cycles. PCR with composite primers: 1pmol of both composite 
primers, 10pmol of #1 and #2 primers; 96°C 2min, (95 G C 20sec, 62°C 20sec, 
68°C 1min20sec)x5 cycles, (95°C 20sec, 58°C 20sec, 68°C 1min20sec)x23 
cycles. Two different annealing temperatures were used for amplification with 
composite primers because melting temperature of external primer #1 (59 ; 6°C) 
was less, than that of internal primers #top and #bot (65°C). 

In. this specification, a number of documents is cited. The disclosure content of 
these documents including manufacturers' manuals, is herewith incorpbrated by 
reference in its entirety. 

the examples illustrate the invention. 
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Example 1 : Padlock synthesis. 

Padlock probes (padlocks) are typically 90-1 20nt oligonucleotides, which consist 
of two locus-specific regions on both 3' and 5' ends connected by universal linker 
part (Figure 1A). The ability of padlocks for template-dependent circularization is 
used for in situ localization and SNP detection (Antson et al. 2000; Lizardi et al. 
1998; Myer and Day 2001; Pickering et al. 2002). High quality of locus-specific 
ends is important in ligation reaction, because they should create a nick with 
perfect base pairing. 

The scheme of the ligation-based synthesis of padlocks is shown on Figure 2B. 
Adaptor primers #aL and #aR and central part primer #C (all shown black) are 
common for the whole set of padlocks. Primers #R and #L (shown white) are 
locus-specific. 5nt 3' and 5' overhangs of adaptor primers serve for ligation of 
locus-specific primers. Small excess of adapters (1,1x)'and locus-specific 
primers (1 ,2x) guarantee that all #C oligonucleotides will be consumed in ligation. 
The scheme is practically the same for synthesis of oligos with common 3' or 5' 
regions (just one adaptor and one locus-specific primer instead of two). 

Melting temperatures of overlaps of adaptor primers with common primer are 
about 50 °C in the ligation mixture. In principle, it is possible to use shorter #C- 
complementary segments of adaptor primers thus decreasing the price of the 
procedure. Overhangs for locus-specific primers were selected to be 5nt long 
(see scheme below Table 1). Such a length was satisfactory for the purpose of 
.the present invention. However, longer overhangs showed better ligation 
efficiency (data not presented). Moreover, for longer overhangs it is possible to 
increase the ligation efficiency to about twice by carrying out the reaction at 37°C 
- i.e. the optimal temperature for T4 DNA ligase. 

The method of the invention requires attention to the selection of primer ends 
(termini) to avoid misligation. In accordance with the present invention it was 
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found that due to the high fidelity of T4 DNA ligase this restriction is not stringent. 
To estimate the efficiency of mismatch ligation, we have compared ligation of 3' 
overhangs (3'-CTCGG...5') with perfectly matched primers/oligonucleotides (5'- 
...GAGCC-3') and with two mismatched primers/oligonucleotides: (5-...GAGga- 
3') and (5 , -...GAGCg-3'). In experiments with ten times excess of ligase we have 
not detected any ligation products for overhangs containing one and two 
mismatched bases. No problems with misligation were observed for overhangs 
shown in Table 1. It is possible to improve the ligation accuracy by increasing the 
concentration of monovalent iqns in reaction buffer (Wu and Wallace 1989), and 
to exclude any participation of adaptor pligos in ligation by blocking their 3' ends. 

According to the present New England BioLabs Catalogue, 1 unit of T4 DNA 
ligase is capable to join about 1,2 pmpl of nicks of A/Hind III digested DNA in 1 
hour. In a test ligation of #C with #R and #aR we have obtained a similar result: 
1u - 0,4pmol. Because PEG 6000 increases the ligation efficiency (Pheiffer and 
Zimmerman 1983), we have checked the influence of 5% - 20% PEG 6000 on 
the reaction. 15% PEG turned out to be the most effective, increasing the ligation 
efficiency about 15 times: 1u - 6pmol. For padlock synthesis, 0.5 unit of 14 DNA 
ligase should be added per 1 pmol of the #C primer. This ratio was determined in 
, a series of small-scale ligations with decreasing amounts of the enzyme (data not 
shown) and agrees well with titration on individual nicks. 

Padlock probes were purified in denaturing PAGE (Figure 2A). The yield is 
quantitative: 70% according to spectrophotometer measurements and close to 
100% according to PAGE (Figure 2A). It seems that some UV adsorbing 
impurities are removed on this step. Purified padlock probes run as single bands 
in denaturing PAGE (Figure 2A) and are suitable for SNP-discriminating ligation 
(Figure 4A). PAGE purification is the most time-consuming and laborious step of 
the procedure. However, in conventional phosphoramidite synthesis this step 
also cannot be avoided (see below). Besides, in ligation-based procedures 
distinct differences between the length of padlock and initial primers (Figure 2A) 
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allows to use HPLC purification instead of PAGE. Adaptor primers do not 
participate in ligation and may be repurified together with padlocks in large-scale 
projects. 

Ligation-based procedure is superior if compared with the conventional 
phosphoramidite synthesis. 

Length restriction. Oligonucleotide-producing companies cannot guarantee a 
quantitative yield for pligos longer than 100nt and do not take orders for primers 
longer than 130nt. In contrast, the Ijgation-based method of the present invention 
is only restricted by the synthesis of individual components: for 3-component 
padlock probe the procedure is practical up to 150-180nt. 

Price and quality. According to information from all three companies listed in 
Table 2, a two-step purification (HPLC and PAGE) is required for 80-1 OOnt long 
oligonucleotides. HPLC alone cannot separate full-length oligonucleotides from 
nearby contaminants. Smallest-scale synthesis of one 100nt oligonucleotide 
costs about 170 Euro and gives about 3 nmol of product after HPLC purification 
(Table 2A). Judging from Operon the yield after PAGE purification will be less 
than 1 nmol. 

5 nmol - scale synthesis of single padlock oligonucleotide by ligation method 
costs 120 Euro (Table 2B). In our hands the yield after PAGE purification is more 
then 3 nmol. Comparing "170 Euro per 1 nmol" with "120 Euro per 3 nmol", the 
ligation-based synthesis of the present invention is four times cheaper. What is 
more important, the quality of ligase-based synthesis is higher. It provides bands 
practically without "n-r contaminants. In principle, HPLC purification may be 
used instead of PAGE. 

Synthesis of large sets. For producing a set of oligonucleotides with block 
structure by the phoshoramidite technology, common regions have to be 
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synthesized again and again. The price per oligo does not change if compared 
with the single synthesis. For example, synthesis of 40 100nt padlocks for 20 
SNP loci (two padlocks per loci) by the conventional procedure costs 6720 Euro 
(Table 2A). In the Ijgatio.n-based method the price per oligonucleotide drops 
dramatically, because common parts need to be synthesized only once. The 
same set costs 1846 Euro. The price difference is 10 fold (6720 Euro, 1 nmol of 
qach padlock; 1846 Egro - 3 nmol). 

If commpn parts contain biotin or fluorescein, conventional synthesis will be 2000 
Euro more expensive (40 padlocks x 50 Euro) and the yield drops about two fold. 
Ligation-based synthesis will cost 100 Euro more (2 padlocks x 50 Euro) and the 
yield remains the same. 

Two PCR based methods of padlock synthesis were suggested recently (Antson 
et al. 2000; Myer and Day 2001; and Figure 2C), but they have some 
disadvantages if compared with proposed ligation-based technology. Locus- 
specific primers have long overhangs for PGR initiation (~12nt (Antson et al. 
2000) and ~18nt (Myer and Day 2001)). It is impossible to insert modified bases 
in some definite position during PCR. Using of proofreading polymerase (Antson 
et al. 2000) results in heterogeneous 3* ends of padlocks. Finally, single-strand 
purification by streptavidine PMP's is expensive (about 1m! of Dynabeads is 
required for 500pmol of padlock). 

To summarise, the ligation-based technology in accordance with the present 
invention permits to synthesize parts of composite oligonucleotides in separate 
reactions of the appropriate scale. Waste is minimal. Adaptor primers may be 
reused. Synthesis may be automated, because it is compatible with HPLC-based 
purification. For large sets of composite oligonucleotides the price per one primer 
tends almost equals the price of locus-specific parts and becomes comparable 
with that of 40-50mers, which are widely used now in molecular biology. 
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Example 2: Composite primers for PCR amplification. 

Composite primers with gene-specific 3' regions (Figure 1B) are used for 
multiplex PCR gmpiification (Favis et al. 2000; Undblad-Toh et al. 2000;' 
Eurogenetics -r- genome primer sets), ligase-independent cloning (LIC) and 
Invader assay (Mein et al. 2000). They have "block structure": 3' parts (17-25nt) 
identify gene target and 5' regions define some particular application: second- 
stage PCR (16-20nt), concrete LIC scheme (LIC system, Novagen - 15nt; In- 
Fusion PCR cloning, BP Biosciences Clontech - 16nt; Gateway PCR cloning 
system, Invitrogen - 29rit) and so on. 

Some applications require combination of blocks. For example, the same gene- 
specific part should be attached to different vector-specific parts for optimization 
of the protein expression (Dieckman et al. 2002). Frequently, the required 
amount of primers is very small. Only one successful PCR reaction is necessary 
for LIC or for preparation of genome-sequencing tags (Eurogenetics: 
http^/www.eurogentec.com/code/en/geno_dnaajDrpd.htm). 

Each composite primer should be synthesized individually by conventional 
phosphoroamidite technology. Moreover, decreasing of the synthesis scale does 
not decrease the price of the primers proportionally. Some LIC methods require 
insertion of modified bases in composite oligonucleotides, for example 4-5 
dUracils (Rashtchian et al. 1992) or 2-3 phosphorothioate bonds (de Costa and 
Tanuri 1998; Zhou and Hatahet 1995). in thjs case the price of composite oligos 
increases in 2-3 times. 

PCR amplification is more difficult with long composite primers. Sometimes, the 
only way to obtain single-band product is to use cloned or preliminarily amplified 
fragments as a template. This means that two sets of primers should be used in 
this case: gene-specific pair and composite pair. 
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It is very convenient to prepare small amounts of composite primers by attaching 
presynthesized blocks to each other. In this case it is possible to combine gene- 
specific parts with different 5' parts. 

Small-scale procedure for preparation of composite primers from individual 
blocks with the help of T4 RNA ligase was described previously (Kaluz et al. 
1995). This enzyme needs no overhangs for joining of two pligps. The only 
disadvantage of the method is that some part of the reaction products (depends 
on excess of external primer) have duplication of gene-specific 3' part It is not 
critical for some applications, but can provide problems for cloning. Another 
disadvantage is that, phosphorylation and ligation should be performed 
separately. 

In contrast to the above recited prior art approaches, the (T4 DNA) ligase based 
method of the present invention requires overhangs for ligation, but guarantees 
the homogeneity of the product.. Ligated primers may be. used in PGR without 
any purification because PEG and other components of ligation mixture do not 
inhibit PCR. Ligation and phosphorilation may be performed simultaneously in a 
ligation buffer (result not shown). 

Adaptor primers do not interfere with PCR. To decrease the risk of false priming 
by adaptor, primers and mis-annealing of long composite primers it is most 
appropriate to use a mixture of external and composite primers (in molar ratio 
10:1) instead of composite primers alone in an amplification reaction. 

An example of amplification with composite primers produced by ligation is 
shown in Figure 4B. The ORF of phi 29 polymerase (Blanco and Salas 1996) 
was amplified from 1uJ of phage culture. In both cases the. required products 
were obtained. PCR was slower with composite primers if compare with gene 
specific ones. Worse kinetics at least partly depends on external primers, 
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because the amplification is more effective with (#top and #bot) pair than with (#1 
and #2) pair (results not shown). 

There are two possible schemes of design of adaptor primers for ligation. One 
approach is to order individual adaptor for each particular combination of 5' and 
3' parts. Adaptors should be 10-14nt long (two 5-7nt overlaps). They are cheaper 
than long composite primers. The price of external primers should be left out of 
account, because they may be used with a number of gene-specific primers. A 
20pmol aliquot of HPLC purified external primer costs less then 10 Euro-cent 

Another approach is to use the standard adaptors and to prepare gene-specific 
primers with predefined overhangs (as in the method for padlock preparation, 
see above). Long overhangs may conflict with some cloning schemes, but too 
short overhangs may be a bar for the effective ligation. G/C-reach 5nt overhangs 
(5'-...GAGCC-3') and (5'-ACGGG...-3') are sufficient for effective ligation (see 
padlock preparation above). To obtain an idea about the suitability of shorter 
sequences we have tried to use the (5'-TATG...-3') sequence for the preparation 
of composite primers. This sequence was selected because ATG may be 
combined with the first Met codon of the open reading frame. It turns out, that 4nt 
A/T-rich overhang requires much more Hgase. Moreover, different amounts of 
enzyme were necessary for different combinations of primers (Figure 3). 
Nevertheless, 1jxl (400u) of T4 ligase was enough to prepare 20pmol of 
composite primers. It costs about 1 Euro, much less, if compared with the price 
of long composite primers. 

Conclusion. 

Ligation-based technology is based on construction of composite 
oligonucleotides from individual blocks. Here we have demonstrated, how this 
method may be applied for two different tasks: 

(i) preparative-scale production of padlock probes; 
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(ii) small-scale synthesis of PCR primers. 

Preparative-scale procedure gives products of high quality. In contrast to the 
conventional phosphoroamidite synthesis, HPLC may be used instead of PAGE 
purification. 

Small-scale procedure is fast one-tube reaction. It does not require any 
purification and may be automated. 

In both cases the suggested method is considerably cheaper if compared with 
the conventional phosphoramidite synthesis. The price difference increases at 
least two fold, when oligonucleotides contain modified bases. 
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TABLES 

Table 1. Oligonucleotide sequences. 

SNP positions in template primers (T1 and T2) are in bold. Alignments for 
ligation-based synthesis of padlock and PCR primers are shown under the table. 
Overhangs for ligation are in bold. 



#c 


GGAGGTTGCGAGGCGTATTCATTGCTCAGAATTCACGACTCACG 


#aR 


CCTCGCAACCTCCGGCTC 


#al_ 


CCCGTCGTGAGTCGTGAATT 


#R 


TTGTAAAACGTCGG G AG AAACAGAGAGCC 


#L 


ACGGGACATTTAAGACCAAACTG 


#T1 


CICICIGI I ICICCCGACGI I I IACAACAGI I IGGICI I AAA I G I I CGCCGC 


#T2 


ICICIGM ICICCCGACGI I I IACAATAG I I IGGICI IAAAIGI ICGCCG 




#top 


TATGAAGCATATGCCGAGAAAGATG 


#bot 


. TATGTTTGATTGTGAATGTGTCATCAAC 


#1 


HUGH IAACI I I AAGAAGGAGA I A I ACA 


#2 


GATCCTCAGTGGTGGTGGTGGTGGTGCA 


#a1 


CATATGTATATCTCCTTCTTA 


#a2 


CATATGCACCACCA 
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Table 2. Synthesis of IQOnt primer. 
(A) Conventional method. 



Company 


Synthesis 
scale 


Price 
per base 


Purification 


Guaranteed 
yield and price 


Operon 


1umol 


2.05 
Euro 


HPLC&PAGE 
(11$ Euro) 


0.5nmol 
HPLG&PAGE 
323 Euro 


MWG 


•>0.2 nmol 


1.78 
Euro 


HPSF included 


3 nmol 
HPSF* 
178 Euro 


TIB 


>0.2umol 


1.68 
Euro 


HPLC included 


3 nmol 
HPLC* 
168 Euro 


* - b ol 


th companies recommend to perform additional PAGE purification in 



laboratory.. 

Operon: QIAGEN Operon GmbH, Cologne, Germany; 
MWG: MWG Biotech, Ebersberg, Germany 
TIB: TIB MOLBIOL, Berlin, Germany 



(B) Ligation-based method 

(prices are given according to TIB). 



Component 


Scale and purification 


Guaranteed yield and 
price 


#L,#R 


up to 30 bases long, HPLC purified 


5 nmol; 39.08 Euro 


#aL, #aP> 


up to 20 bases long, standard 
purification 


5 nmol; 19.98 Euro 


#C 


up to 60 bases long, standard 
purification 


5 nmol; 55.00 Euro 


T4 DNA 
ligase 


1250u 


3.25 Euro . 


T4 PNK 


25u 


2.50 Euro 




Total: 




5nmol 
120 Euro 



WO 2004/092375 



PCT/EP2004/003921 



29 



Table 3. Ligation-based synthesis of 40 100nt padlocks for 20 SNP loci 

(two padlocks per one locus). 



Component 


Scale and purification 


Price per 20 loci 

(Euro) 


#L1,#L2,#R 


5 nmol of each, HPLC purified 


1172 


#al_ and #aR 


500 nmol of each *, standard 
purification 


114 


#C1,#C2 


250 nmol of each *, standard 
purification 


330 


T4 DNA ligase 


50000u 


130 


T4 PNK 


1000u 


100 


* Two times more, than is required for 40 padlocks. 


Total: 




1846 Euro 


Per one 
padlock: 


5nmol 


46.2 Euro 
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CLAIMS 

1. A method of producing single-stranded nucleic acid molecules from oligo- 
or polynucleotides wherein each of said oligo- or polynucleotides has a 
predefined 5' or 3' terminus, comprising the steps of 

(a) annealing an adaptor oligonucleotide simultaneously or step by 
step to 

(aa) a first oligo- or polynucleotide; and 

(ab) a second oligo- or polynucleotide 

wherein the 5'-terminus of said adaptor oligonucleotide is 
complementary in sequence to the 5' terminus of said first oligo- or 
polynucleotide and the 3'-terminus of said adaptor molecule is 
complementary in sequence to the 3' terminus of said second oligo- 
or polynucleotide; and optionally 
(a f ) simultaneously with or subsequently to step (a) annealing at least 
one further adaptor oligonucleotide to free termini of said first or 
second oligonucleotides and to free termini of further oligo- or 
polynucleotides; 

(b) optionally filling in gaps between the neighbouring ends of said oligo- 
or polynucleotides; 

(c) ligating said oligo- or polynucleotides; and 

(d) removing said at least one adaptor oligonucleotide. 

2. The method of claim 1 wherein the complementarity in sequence is at 
least four nucleotides. 

3. The method of claim 1 or 2 wherein annealing and ligation are 
. simultaneously performed. 

4. The method of any one of claims 1 to 3 wherein the adaptor 
oligonucleotide(s) in step (a) and/or (a') is/are provided in molar excess 
over the first or second or further oligo- or polynucleotides. 
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5. The method of any one of claims 1 to 4 wherein said single-stranded 
nucleic acid molecules represent a collection of nucleic acid molecules and 
wherein either said first or said second oligo- or polynucleotide is invariable 
in sequence between all members of said collection* of nucleic acid 
molecules. 

6. The method of claim 5 wherein said first or said second oligo- qr 
polynucleotide which is not invariable is variable in sequence between 
different members of said collection of nucleic acid molecules. 

7. The method -of daim 5 or 6 wherein the further oligo-, or polynucleotides 
are variable in sequence between different members of said collection of 
nucleic acid molecules. 

8. The method of any one of claims 5 to 7 wherein the oligo- or 
polynucleotides representing said variable sequences are provided in 
molar excess over the nucleic acid molecule representing said invariable 
sequences. 

9. The method of any one of claims 5 to 8 wherein the 5' or 3' termini of said 
oligo- or polynucleotides representing said variable sequences which 
anneal to said 5' or 3' termini of said adaptor oligonucleotide are invariable 
between different members of said oligo- or polynucleotides representing 
said variable sequences. 

10. The method of any one of claims 1 to 9 where ligation is effected with 
T4/DNA ligase. 

11. The method of any one of claims 1 to 10 wherein the ligation reaction is 
. carried out in the presence of at least 5% polyethylene glycol. 



12. The method of claim 8 wherein the ligation reaction is carried but in the 
presence of about 15% polyethylene glycol. 
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13. The method of claim 11 or 12 wherein said polyethylene glycol is 
polyethylene glycol 6000. 

14. The methpd of any one of claims 1 to 1 1 wherein about 1 unit of T4/DNA 
ligase is reacted in step (c) with about 4 pmol of termini of the oligo- or 
polynucleotides gnnealed to said adaptor molecule(s). 

15. The method of any one of claims 1 to 12 further comprising the step of 
purifying said single-stranded nucleic acid molecules. 

16. The method of claim 15 wherein purification includes PAGE 
electrophoresis, HPLC or chromatography. 

17. The method of any one of clgims 1 to 16 further comprising modifying at 
least one of said oligo- or polynucleotides. 

18. The method of any one of claims 1 to. 16 wherein at least one of said oligo- 
or polynucleotides is modified. 

19. The method of claim 17 or 18 wherein the modification is a ribonucleotide, 
a spacer or a nucleotide comprising a detectable label. 

20. The method of any one of claims 17 to . 19 wherein said oligo- or 
polynucleotides representing the invariable sequence are modified. 

21. The method of any one of claims 5 to 20 further comprising employing 
members of said collection of nucleic acid molecules in the determination 
of SNPs in vitro. 

22. The method of any one of claims 5 to '20 further comprising employing 
members of said collection of nucleic acid molecules in ligase-independent 
cloning or two-step PGR. • 
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